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A multitude of factors contribute to wliy and liow ronnantic relationships are formed as well 
as whether they ultimately succeed or fail. Drawing on evolutionary models of attraction 
and speech production as well as integrative models of relationship functioning, this review 
argues that paralinguistic cues (more specifically the fundamental frequency of the voice) 
that are initially a strong source of attraction also increase couples' nsk for relationship 
failure. Conceptual similarities and differences between the multiple operationalizations and 
interpretations of vocal fundamental frequency are discussed and guidelines are presented 
for understanding both convergent and non-convergent findings. Implications for clinical 
practice and future research are discussed. 
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INTRODUCTION 

Across cultures, adults rate marriage and long-term monoga- 
mous relationships as the most important ones in their lives 
(Buss, 2005). Despite this, divorce rates in industrialized countries 
rank between 40 to 55% (Hahlweg etal., 2010). A tremendous 
amount of effort has been accordingly devoted to identifying and 
understanding risk factors for relationship dissolution (Gottman 
and Notarius, 2002) with a particular emphasis on identifying 
risk factors that are present from the outset of a relationship 
(Caughlin and Huston, 2002). The polarization model of roman- 
tic relationships (Jacobson and Christensen, 1998; Baucom and 
Atkins, 2013) suggests that one of the most important kinds of 
risk factors originates in variables that are initially associated 
with attraction and desire that later become sources of distress 
and interpersonal friction. A growing body of empirical evidence 
suggests that the fundamental frequency of romantic partner's 
voices may represent precisely this kind of risk. Vocal quali- 
ties that are found to be attractive during the early stages of 
relationship formation are also associated with increased levels 
of dysfunction, increased risk for divorce, and decreased like- 
hhood of benefitting from couple therapy in later stages of a 
relationship. 

Evolutionary models suggest that perceptions of health sta- 
tus and likelihood of reproductive success are one of the primary 
determinants of attractiveness and mate selection. Physical aspects 
of attractiveness like facial symmetry, body height, and the voice 
are indicative of levels of sexual hormones, and thus of the like- 
Uhood of producing healthy offspring. Generally, indicators of 
higher levels of masculinity in males (based on higher levels of 
testosterone) and higher levels of femininity in females (based on 
higher levels of estrogen) are perceived as more attractive and are 



linked to better reproductive success. This leads to a dimorphism 
in attractiveness ratings: males tend to prefer female partners 
with higher fo voices and smaller body sizes while female tend 
to prefer males with lower fo voices and larger body sizes (Puts 
etal., 2012). Though these sex-related differences are important 
determinants of attraction, the polarization model of relationship 
distress suggests that they may become associated with dysfunc- 
tion and increased risk for divorce as relationships mature. A 
change from attraction to distress is often accompanied by a 
change in attribution for the difference itself Differences that 
are seen as enriching and complimentary are often experienced 
as desirable (Aron and Aron, 1997) while differences that are 
seen as short-comings and faults in the other commonly con- 
tribute to a cycle of distress. When differences are seen as faults 
in the other, spouses typically alternate between criticizing and 
blaming one another for the problems in their relationship and 
defending themselves from their partner's attacks (i.e., they engage 
in a strong demand/withdraw cycle of conflict). As this process 
unfolds, the relationship becomes polarized, and the three core 
aspects of interaction, communication, perception, and physiol- 
ogy, become imbalanced (Burman and Margolin, 1992; Gottman, 
1993), conflict ensues, and even routine interaction with the 
spouse becomes highly aversive. The aversive nature of this cycle 
typically results in increasing polarization over time and makes it 
very difficult for spouses to resolve even minor problems. Being 
stuck in the polarization cycle is highly distressing for spouses, 
and high levels of aversive arousal during interaction are one of 
the most robust predictors of risk for relationship distress and 
dissolution (Gottman and Levenson, 1992). Aversive arousal has 
most often been captured via well-established physiological indices 
like heart rate (HR), blood pressure (BP), or skin conductance 
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(Larsen etal., 2008) as well as via endocrine measures like Corti- 
sol, or epinephrine (Robles and Kiecolt-Glaser, 2003; Ditzen et al, 
2011). In addition to well-replicated findings based on physio- 
logical measures, a growing body of evidence shows that higher 
levels of vocally encoded emotional arousal, which are reflected in 
higher levels of fo, are similarly related to increased risk for a wide 
range of negative relationship outcomes. 

FUNDAMENTAL FREQUENCY (fo) 

As a mathematical quantity, fundamental frequency (fo) refers to 
the lowest frequency harmonic of the speech sound wave. This 
frequency is biologically determined by the pattern of vibration 
created by the vocal folds during phonation, the phase of speech 
production where the outward flow of air from the lungs is reg- 
ulated by the larynx. Higher rates of opening and closing of the 
vocal folds across the glottis are associated with higher fo values 
(measured in cycles per second or Hertz [Hz]). Perceptually, fo is 
highly correlated with pitch where higher fo values correspond to 
higher pitch (Juslin and Scherer, 2005). Fq can be easily calculated 
using Praat, a freely available, Windows-based software package 
(Boersma and Weenink, 2013; www.praat.org), to analyze existing 
audio recordings of speech in adequate quality. Fo is assessed con- 
tinuously during human speech and can change rapidly. Different 
parameters like mean fo, fo range, minimum fo, or maximum fo 
can be calculated and used in empirical research. All of those can 
be calculated either at a very small scale (such as for each talk 
turn) or averaged across over the entirety of a conversation. As 
is true of much behavioral research, there is greater agreement 
about how to calculate fo than there is about how to understand 
what fo represents. Fq has been variously interpreted as an index of 
vocally encoded emotional arousal (see Juslin and Scherer, 2005 for 
a review; Weusthoff et al, 2013) and dominance (Puts et al., 2006; 
Borkowska and Pawlowski, 2011), and additional work demon- 
strates that fo correlates with other factors such as age and pubertal 
(HoUien and Shipp, 1972; Brown etal., 1991; HoUien etal, 1994, 
1997), phonemic and syntactic structure of speech (Whalen and 
Levitt, 1995). Recent work examining fo during social interaction 
provides a framework for integrating the various interpretations of 
fo. In Weusthoff etal.'s 2013 examination of simultaneous associ- 
ations between fo, biological sex, physiological indices of arousal, 
and social behavior, biological sex was the largest predictor of 
individual differences in fo while physiological arousal and social 
behaviors were the best predictors of variance in fo attributable to 
a specific social interaction. These results suggest that fo can be 
understood as conveying information about both traits (such as 
biological sex) and states (momentary physiological arousal and 
social behavior) and highlights the need for careful analysis of fo 
to allow for specific interpretations. 

METHOD OF REVIEW 

In contrast to the nascent body of research examining fo during 
marital interactions,fo has been intensively studied using different 
research paradigms across multiple disciplines (e.g., single conver- 
sations recorded during naturally occurring stressful events like 
the New York City blackout in 1977, Streeter et al., 1982, or emo- 
tion portrayals by professional actors, Banse and Scherer, 1996; 
see Scherer, 2003 for a review). For example, a key- word based 



search on SCOPUS (using the search terms "social interaction" 
AND "verbal" OR "non-verbal" in "Keywords, Abstract, or Title") 
yielded more than 2000 results. Here, we focus on research with 
a specific focus on relationship formation and maintenance. The 
body of work on fo and attraction is robust and well-developed. 
Thus, we seek to provide a representative sampling of the main 
findings in this area. In contrast, there are many fewer studies 
of naturalistic social interaction in general much less about mar- 
ital interaction specifically. As others have noted, studies using 
naturalistic vocal expressions of affect with sufficient intensity of 
the emotions displayed are needed in vocal expression research 
in order to obtain speech that offers sufficient levels of "ecologi- 
cal validity" (Juslin and Scherer, 2005). Communication in close 
relationships seems to fulfill these criterions but research has only 
recently begun to take fo into account here. 

Research with human participants can be split up in two farther 
groups: whether interlocutors had some form of relationship with 
each other before interacting, or not. In the case of pre-existing 
relationship, four main areas of relationships have been covered: 
the ones between parents and infants (see Irwin, 2003 for a review), 
between psychotherapists and clients (see Greenberg and Pascual- 
Leone, 2006 for a review), between physicians and patients (see 
Hassan et al., 2007; Hulsman et al, 20 1 1 for a review), and between 
spouses in intimate relationship. Independent of targets, fo has 
been found to be associated with arousal. As it it beyond the scope 
of this review to cover this all, the interested reader is referred to 
the according reviews. In couples interactions, however, the role 
of vocally encoded emotional arousal has not been investigated 
closely. This review aims at looking into fo and its associations 
with couple functioning more closely. The authors identified a 
total number of JV = 5 studies for this task. All analyses were 
based on dyadic communication settings (conflict discussions) 
between spouses from long-term or married heterosexual couples. 
Instructions for the tasks were standardized, and recordings of 
the interactions were videotaped during assessment sessions con- 
ducted in research laboratories. Participants were either English 
(Baucom etal., 2009, 2011), or German (Baucom etal., 2012b; 
Kliem etal, 2012; Weusthoff etal, 2013) native speakers (for a 
detailed description of the studies included in this review, please 
see Table 1). 

As also described in Weusthoff etal. (2013), fo was obtained 
continuously from the speech of each person during the prob- 
lem discussion of the assessment point(s). Conversations were 
segmented per speaker using audio editing programs like Adobe 
Premier Pro, resulting in data per speaker containing only con- 
versation parts during which he or she talked separately without 
any other human, or background sound being present. Bandpass 
filtering was applied to all voice samples prior to further analyses 
(calculating fo values) in order to restrict fo values to the normal 
range of emotional adult speech. The typical range for male speak- 
ers is 75-150Hz, for female speakers 150-300Hz, though wider 
limits can be observed during highly aroused emotional states 
(Owren and Bachorowski, 2007). Common fo mean values are 
around 225 Hz for women, and about 120 Hz for men. Across the 
lifespan, women's mean fo decreases while male speakers' mean 
fo scores also decrease first, but start to rise again. Changes in 
adults are most prominent for both sexes between age 50 and 
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60, with women additionally experiencing hormonal influences 
associated with the onset of the menopause (HoUien and Shipp, 
1972; Brown etal, 1991;Hollien etal, 1997). Though it is possible 
for human speakers to generate sound outside the typical filtering 
limits of 75 and 300 Hz, fo scores outside of this range are very likely 
resulting from background machine or electronic noise. Mini- 
mum, maximum, and mean fo values were generated separately 
for each spouse by analyzing their respective segmented audio 
recordings using Praat, a free multiple platform program (Boersma 
and Weenink, 2013). Fq range was calculated separately for each 
partner at each assessment by subtracting the partner's averaged 
minimum fo from the partner's averaged maximum fo across the 
whole conversation. In Baucom etal.'s (2011) study, fo mean was 
derived from raw fo scores across the whole conversation. For a 
more detailed description of potential fo range indices, please see 
Table 1 in Baucom et al. (2012a). 

fo IN INTIMATE RELATIONSHIP RESEARCH 

Due to three reasons, it is important to farther investigate the 
role of fo in intimate relationships. One, relationship deteriora- 
tion and/or separation are associated with tremendous costs for 
affected individuals and society as a whole, both concurrently 
and in the long run. Dysfunctional relationships lead to poorer 
mental and physical health in spouses (Kiecolt-Glaser and New- 
ton, 2001), a higher likelihood for divorce and reduced levels of 
social support (Amato, 2010). Due to higher levels of unhealthy 
behaviors like smoking, even a decreased life expectancy (Lar- 
son and Halfon, 2013) in affected children has been observed. 
Second, couple-relationship education programs (CRE) aim at 
reducing the risk of relationship dissolution by teaching com- 
munication skills and ways of dealing with aspects of couple's 
emotional lives, e.g., EPL - Ein Partnerschaftliches Lernprogramm 
("A Learning Program for Couples"; Kaiser etal., 1998), espe- 
cially during conflict discussions. These programs are known to 
be highly effective in doing so both in the short and in the long 
run (Hawkins etal., 2008; Hahlweg and Richter, 2010). However, 
the mechanisms of change in CRE and how emotional arousal 
affects the work in and outcome of CRE are stiU unknown. Com- 
munication is assumed to be the central mechanism of change, 
however, empirical studies using the paradigm of couples hav- 
ing videotaped conflict discussions that are later rated and / or 
analyzed with regard to different aspects of behavior have not con- 
sistently supported this supposition. One reason for this might be 
the rather simplistic ways of operationalizing a complex behav- 
ior like human communication as single elements like positive, 
or negative utterances (Christensen, 2010). Therefore, research 
should consider several aspects of communication simultane- 
ously in a multi-channel approach (Gottman and Notarius, 2002). 
Third, though emotional arousal is known to play a crucial role 
in relationship dissolution (Gottman and Levenson, 1992), the 
specific pathways and interactions with other important vari- 
ables like communication behavior, or relationship satisfaction 
are still unknown (Weusthoff etal., 2013). As future research in 
this field should focus on how to "modify emotion- driven, dys- 
functional, and destructive interactional behavior," and to "elicit 
avoided, emotion-based" behavior (Christensen, 2010, pp. 36-37), 
there is additional need for an objective tool for detecting and 



explaining emotional arousal in the context of couple interactions 
and intimate relationships. 

During courtship, fo is one of the evolutionary most important 
signals for non-visual gender discrimination (which is important 
for successful identification of potential mates; lunger et al., 2013), 
and for judging the attractiveness of a potential spouse (Borkowska 
and Pawlowski, 2011). Very masculine voices in males (perceived 
as low voice pitch, and thus low fo), and very feminine voices in 
females (perceived as high voice pitch, and thus high fo) are per- 
ceived as particularly attractive (DeBruine et al., 2006; Feinberg 
etal., 2006, 2012; Borkowska and Pawlowski, 2011; O'Connor 
etal., 2012). These perceptions are attenuated in both men and 
women during high-fertile phases of the female menstrual cycle 
(Puts, 2005; Pipitone and Gallup, 2008; Hodges-Simeon etal, 
2010). High vocal masculinity in males are linked to high levels of 
testosterone, and associated with a higher likelihood for produc- 
ing healthy offspring (O'Connor et al., 2012). However, in the long 
run, males with higher levels of vocal masculinity report a larger 
number of different sexual partners, and are less likely to engage in 
relationship maintaining and parental behaviors (O'Connor et al, 
2011, 2012). It also seems plausible that fo during relationship 
initiation stages displays high levels of arousal that is taken by 
partner as an indicator for higher engagement in a relationship, 
also hinting at a process similar to polarization. 

During maintenance, less research on fo's role has been con- 
ducted so far. Fo has most often been studied in couples' conflict 
interactions where it seems to index levels of emotional arousal. 
Fo during couple conflict has been demonstrated to be signifi- 
cantly positively associated with multiple cardiac, and endocrine 
indices of arousal, and linked to more negative and less positive 
communication behavior. Fq is thought to simultaneously dis- 
play autonomic physiological as well as socially learned reactions 
in one signal (Weusthoff etal., 2013). Furthermore, it has been 
shown that spouses influence each other in their levels of arousal. 
If one partner is highly aroused, it is becomes more difficult for 
the other one to maintain on a functional level of arousal, thus 
leading to polarization processes with regard to fo (Baucom and 
Atkins, 2013). 

As also noted elsewhere (Weusthoff etal., 2013), fo offers a 
number of methodological and conceptual benefits with regard 
to being further used in research on romantic relationships. Its 
non-invasive nature and lack of need for additional equipment 
(only a good audio recording device is needed) makes the assess- 
ment of fo an excellent candidate for data collection in situations 
like conflict interactions that are most often videotaped, or for 
post-hoc analyses (e.g., spousal discussions, or therapy sessions). 
Additionally, fo can be computed and analyzed from the video- 
tapes even after conducting a study in which it was not of primary 
interest. Furthermore, fo is more closely associated with individual 
psychological than physical distress (johannes etal., 2007), mak- 
ing it more sensitive to detect changes in psychological load and 
less prone to artifacts that influence physiological measures of 
arousal like HR or skin conductance (e.g., movements, Sloan and 
Kring, 2007). Being based on physical properties of speech, fo 
can be analyzed objectively and independent of the researcher's 
native language and culture (Weusthoff et al., 2013). Perhaps most 
importantly, fo is directly involved in and available during the 
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process of communication and can be perceived by a listener 
(Baucom etal., 2011). There is good evidence that partners in 
communication directly respond to each other's fo without being 
aware of this fact (Gregory and Webster, 1996). Similar to cardio- 
vascular indices of arousal (e.g., HR), fo seems to encode aspects 
of emotional arousal that are part of an autonomic process not 
entirely controllable by a speaker, similar to conditioned emotional 
responses (Kliem etal., 2012). 

Merging these theories, expression of emotions (emotional 
expression) toward one's spouse can be considered as a central 
part of well-functioning intimate relationships, the role of under- 
lying emotional arousal, however, seems to be somewhat different. 
Endocrine and psychophysiological indices of heightened levels of 
emotional arousal have consistently been linked to unwanted rela- 
tionship outcomes like dysfunctional communication behavior, 
higher risk for divorce and separation, or lower levels of relation- 
ship satisfaction (Gottman and Levenson, 1992; Kiecolt-Glaser 
and Newton, 2001; Gottman and Notarius, 2002). As emotions 
and their underlying processes like arousal are essential parts of 
social interactions (luslin and Laukka, 2003) and can influence 
them heavily (Scherer, 2005), it makes them important informa- 
tion not only for researchers in the social interaction area but also 
for people's everyday lives. 

STUDIES REVIEWED 
SUMMARY OF FINDINGS 

Across studies, languages, and gender, fo has been treated as 
an index of emotional arousal. Like other indices fo has been 
associated with a number of different negative aspects of cou- 
ple functioning both concurrently and in the long run. Significant 
associations have been found in conflict interactions of distressed 
couples, either participating in couples therapy, or in CRE. More 
specifically, higher levels of a person's physiological indicators of 
emotional arousal like HR, BP, or salivary Cortisol were associated 
with higher levels of one's own fo. Higher levels of a spouse's fo 
have also been linked to higher levels of negative communication 
behavior, both observed and self-reported, and to lower levels of 
observed non-verbal positive communication behavior. In highly 
distressed couples, the likelihood for long-term success of couple 
therapy (being in the recovered range 2 years after treatment ter- 
mination) was higher when wives' displayed lower levels of fo in a 
conflict discussion prior to treatment. Elevated levels of fo in con- 
flict discussions prior to participation in CRE also lead to a higher 
likelihood for separation and divorce, and a smaller number of 
communication skills remembered 1 1 years after the CRE. 

DISCUSSION 

This review aimed at examining research on vocally encoded 
emotional arousal, namely fo, in close personal relationship 
communication (interaction between spouses). Significant associ- 
ations were found with psychophysiological and endocrine indices 
of emotional arousal, observed and self-reported communication 
behavior, and relationship functioning (stability and satisfaction). 
Furthermore, fo has been found to be related to skills remem- 
brance in CRE. Significant results were found for concurrent as 
well as longitudinal links between fo and variables of interest, and 
for a time frame as long as II years. Significant gender differences 



emerged for fo range's association with some variables of interest 
but not for fo mean's. 

FINDINGS CONSIDERED FROM AN EVOLUTIONARY PERSPECTIVE 

Across studies, fo was analyzed as an index of emotional arousal 
thought to display information about the internal state of the 
speaker to an interaction partner via the human voice. Stable 
associations between fo and different forms of communica- 
tion behavior (positive, negative, observed, self-reported, verbal, 
and non-verbal) emerged across studies. Social signaling theo- 
ries assume evolutionary reasons for this: In order to heighten 
chances of survival in ancient hunter-gatherer societies, emo- 
tions were adaptive developments helping to avoid danger and 
to cooperate with others (Buss, 2005). Vocally encoded emo- 
tional arousal enables individuals to communicate emotions 
non-verbally from one person to another, independent of words 
and language. Vocal communication of emotion is more likely to 
appear in goal-relevant behavior (Juslin and Laukka, 2003). Fq 
scores indexing vocally encoded emotional arousal should there- 
fore be higher during goal-relevant behavior. What is considered 
as goal-relevant behavior depends on situational aspects: Cou- 
ple and family conflict is conceptually thought of as a chronic 
stressor (Doss etal., 2004) where often a wide range of nega- 
tive communication behavior (like DW) is displayed in order to 
achieve change in one's spouse (Christensen et al., 2006) and high 
levels of emotional arousal are displayed via multiple channels 
(Kiecolt-Glaser and Newton, 2001). Positive associations between 
fo and these variables as found in the reviewed studies inves- 
tigating conflict could be indexing goal-achieving strategies by 
spouses. 

With regard to the theoretical foundation of emotional arousal, 
coordinated responses in multiple physiological systems stem from 
the same source, namely the cognitive appraisal in a given situation 
and the following changes in autonomous and somatic nervous 
system activity (Scherer, 2009). These changes were assessed via 
multiple channel of arousal in Weusthoff etal.'s 2013 study, with 
the associations between fo and psychophysiological indices of 
arousal being in expected directions. With the periaqueductal 
grey (PAG), a brain-stem based areal (and thus an evolutionary 
old one) in humans is involved in both vocal and cardiovascu- 
lar responses to different kinds of stress. The PAG is thought 
to integrate emotional aspect of stress responses into the auto- 
nomic nervous system responses in different channels (Linnman 
etal., 2012). These findings suggest that fo as an index of emo- 
tional arousal, and thus the intensity of the emotional reaction 
(Weusthoff etal., 2013), are influenced by both basic biological 
processes as well as by sociaUy learned communication behaviors 
stemming from a similar evolutionary basis (Juslin and Laukka, 
2003). 

Different brain regions are found to be involved in processing 
male and female voices (Sokhi etal., 2005), and discriminatory 
performance for human voices was better for opposite-sex stim- 
uli than for same sex-stimuli (faster identification; Junger etal, 
2013). Fo seems to enable a stronger attendance toward stimuli 
higher in social significance like potential mates (Feinberg, 2008), 
which could be considered as an important aspect of relationship 
functioning not only in initiation but also in maintaining phases. 
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The studies covered in this review seem to foster this interpreta- 
tion: during later phases of relationship and especially in "rough" 
times, spouses seem to especially use fo as a non-verbal signal in 
relationship-relevant information like expression of distress. 

FINDINGS WITH REGARD TO GENDER DIFFERENCES 

Across age and interaction settings, gender differences were found 
in a number of studies. Biologically speaking, fo is based on vocal 
cord length and tension. Given the physiological differences in 
body size and thus, throat length between men and women, gen- 
der differences in fo emerge after puberty with females on average 
having significantly higher fo scores than males (Titze, 1989). Two 
studies in this review (Baucom et al., 2012b; Weusthoff et al, 2013) 
explicitly investigated gender differences and as expected found 
higher fo range scores for female than for male speakers. Except 
for Baucom etal. (2011) and Weusthoff etal. (2013), all studies 
investigating adult speakers included in this review found vari- 
ables of interest to have significant associations with fo range for 
female speakers only, or found different associations for men and 
women. Given that female partners are more likely to seek help 
with regard to problems and/or conflict in close personal relation- 
ships (O'Brien, 1988) it seems also plausible that gender differences 
could stem from different goals and resulting goal-oriented behav- 
ior in males and females (especially as no associations being 
significant for men only emerged). Finding the best partner to 
produce healthy offspring seems to be an important goal during 
selection of a potential mate and sex partner during courtship. 
Significant associations between perceived, and self-rated attrac- 
tiveness and health, and fo for both sexes during sexual selection 
(though due to endocrine reasons in different directions, with 
higher attractiveness and perceived health being associated to fo 
scores in women and lower fo scores in men; Puts, 2010) also hint 
at fo being an important aspect in goal-relevant behavior. 

LIMITATIONS 

The work reviewed in this manuscript has taken into account only 
three indicators of vocally encoded emotional arousal out of a 
wide variety of potential ones (Juslin and Scherer, 2005, pp. 103- 
104): fo range, fo mean, and fo -time-to-peak as a time-varying 
aspect of fo. This has happened for a number of reasons. As ear- 
lier work on vocally encoded emotional arousal has focused on fo 
mean in a number of different research settings (i.e., fo in speech 
of psychiatric patients, Tolkmitt et al., 1982; or emotion portrayal 
by professional actors, Banse and Scherer, 1996), it was chosen 
as the parameter of interest in the first study conducted on fo 
in close personal relationships (Baucom etal., 2011) in order to 
enable comparisons to these published empirical findings. How- 
ever, among the different fo indices that can be calculated fo range 
seems to bear a number of advantages in research on close per- 
sonal relationships (see Juslin and Scherer, 2005 for a review), and 
was therefore chosen for the studies conducted later, fg range is 
considered to be the cleanest vocal indicator of emotional arousal, 
and to depict the biggest amount of information on emotional 
arousal (Juslin and Scherer, 2005; Busso et al, 2009). Furthermore, 
it facilitates the interpretation of gender differences as it adjusts 
for potential individual differences by way of its calculation (sub- 
tracting the individual's minimum fo score from the individual's 



fo maximum score; Weusthoff et al., 2013). Nonetheless, it is pos- 
sible that other indicators of vocally emotional arousal (e.g., jitter) 
might contain additional information on the nature of emotional 
arousal in close personal relationship communication. 

Though biological reasons for differences in male and female 
voices are well-documented and have been discussed in this review, 
it remains unclear why expected sex differences for fo indices 
and their associations with different variables of interest did not 
emerge consistently across studies. Particularly, none of the studies 
found significant gender differences in the associations between 
fo mean (compared to studies using fo range) and variables of 
interest. Future research should regard gender as a covariate in fo 
research in close personal relationship but more detailed inves- 
tigations on the details of gender differences in fo indices are 
needed. 

Investigating emotional arousal in human speech using fo 
includes the drawback of participants being able to willingly influ- 
ence and control fo to a certain degree. Empirical evidence has 
shown that conscious manipulations can result in both elevated 
and lowered pitch levels that are identifiable by a communication 
partner (Kuenzel, 2000). However, given the arousing, stressful, 
and cognitively demanding situation of couple conflict, it is quite 
unlikely that participants in the studies reviewed made use of this 
hypothetical possibility. 

IMPLICATIONS FOR CLINICAL WORK AND FUTURE RESEARCH 

As fo seems to be able to display an individual's internal levels 
of emotional arousal, other dyadic settings in which emotional 
arousal is considered to be important information could also 
benefit from research on vocally encoded arousal. During social 
support interactions between spouses, showing the interaction 
partner positive and helpful behaviors in order to help him or 
her to deal with stressful situations is the main goal. Social sup- 
port is considered to consist of different positive and helpful 
behaviors being observable in communication between interac- 
tion partners leading to changes in individual cardiovascular, and 
endocrine functioning which influences individual health out- 
comes. Consistent with this model, lower levels of social support 
behavior have been empirically linked to higher levels of physio- 
logical indices of arousal (most often BP), and to poorer health 
outcomes in affected individuals (Uchino etal., 1996). Given the 
associations between BP, and HR with fo in couple conflict, vocally 
encoded emotional arousal also seems to be important during 
social support interactions between spouses. However, in com- 
parison to conflict discussions, more positive and less emotionally 
arousing behaviors seem be goal-relevant in social support and 
should therefore be displayed more often between interaction part- 
ners (Verhoftstadt etal., 2005). Links between fo and variables 
of interest in social support behavior could therefore be of dif- 
ferent direction and magnitude than the ones found in conflict 
discussions. 

Communication Accomodation Theory (CAT; Giles and Cou- 
pland, 1991) describes changes in partners' communication styles 
during a conversation leading to more, or less similitarity between 
the partners (convergence and divergence). Convergence can 
occur on different levels (e.g., pitch pattern, speech rate, or 
emotional expression), and has empirically been shown to occur in 
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various dyads. Convergence seems to happen somewhat naturally 
but is associated with different outcomes depending on the inter- 
action setting and other external factors in intimate relationships. 
Convergence seems to be a beneficial process during interactions 
asking for social support between partners. For example, high lev- 
els of emotional convergence (meaning similarity in emotional 
expressions between spouses) while sharing emotional events of 
one's day are related to higher levels of concurrent and longitudinal 
relationship satisfaction, and higher relationship stability (Ander- 
son etal., 2003). During couple conflict, however, convergence 
in emotional expression and behavior like the DW pattern have 
negative impacts on relationship functioning and outcomes (Lee 
etal., 2012; Baucom and Atkins, 2013). A closer look at poten- 
tial reasons for these differential effects of convergence in couples' 
communication could be a fruitful avenue for future research. Are 
the associations due to levels of relationship satisfaction (happy vs. 
unhappy couples), do they depend on the context of the interac- 
tion (social support vs. conflict), or do both factors and/or a third 
one contribute? 

Vocal characteristics between talkshow host and guests have 
also been demonstrated to converge across the course of an inter- 
view. However, across different dyads, status and/or power seem 
to be an important factor influencing convergence: The conver- 
sation partner lower in status and power (having a greater "need 
for social approval"; Giles and Coupland, 1991, p. 73) exhibits 
more change in vocal and emotional expressions, moving in 



the direction of the more powerful partner (Gregory and Webster, 
1996). 

Power processes are also influential in couples communica- 
tion (Baucom etal., 2011), and are interdependently linked to 
each other. For example, fo in distressed spouses' problem-solving 
discussions is known to converge in terms of magnitude: if one 
spouse is highly aroused, the other partner is also more likely to 
be highly aroused. Furthermore, this covariation seems to lead to 
problems in partner's ability to regulate their own state of arousal 
to a comfortable level (Baucom and Atkins, 2013). 

Given these findings, it seems likely that spouses' fo scores could 
also be associated with each other across speakers. Gottman and 
Notarius (2002, p.l85) explicitly state that there is a"need for con- 
tinued focus on sequences or patterns of interaction" in order to 
identify beneficial and harmful aspects of spousal communication. 
Sequential analyses of fo could shed light on presence, magnitude, 
and direction of interdependent and cyclical aspects of emotional 
arousal in couple communication. 
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